Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs
نویسندگان
چکیده
The loss functions of deep neural networks are complex and their geometric properties are not well understood. We show that the optima of these complex loss functions are in fact connected by a simple polygonal chain with only one bend, over which training and test accuracy are nearly constant. We introduce a training procedure to discover these high-accuracy pathways between modes. Inspired by this new geometric insight, we also propose a new ensembling method entitled Fast Geometric Ensembling (FGE). Using FGE we can train high-performing ensembles in the time required to train a single model. We achieve improved performance compared to the recent state-of-the-art Snapshot Ensembles, on CIFAR10 and CIFAR-100, using state-of-the-art deep residual networks. On ImageNet we improve the top-1 error-rate of a pre-trained ResNet by 0.56% by running FGE for just 5 epochs.
منابع مشابه
Two-Surfaces Sliding Mode Controller for Energy Management of Electric Vehicle Based on Multi Input DC-DC Converter
In this paper, a two-surfaces sliding mode controller (TSSMC) is proposed for the voltage tracking control of a two input DC-DC converter in application of electric vehicles (EVs). The imperialist competitive algorithm (ICA) is used for tuning TSSMC parameters. The proposed controller significantly improves the transient response and disturbance rejection of the two input converters while p...
متن کاملکاربرد سطوح تناسب و مقاومت زیستگاهی در ارزیابی تغییرات زیستگاهی
Habitats have dramatically destructed worldwide.However a growing trend is emerging for restoring habitatats. One of the most effective approach to revitalize them is to restore the conditions that have lost. Studies indicate high probability of local extinction of Maral (Cervus elaphus maral) in the current habitats of Gilan due to severe habitat destruction. The current study aimed to introdu...
متن کاملFast and Accurate Inference with Adaptive Ensemble Prediction in Image Classification with Deep Neural Networks
Ensembling multiple predictions is a widely used technique to improve the accuracy of various machine learning tasks. In image classification tasks, for example, averaging the predictions for multiple patches extracted from the input image significantly improves accuracy. Using multiple networks trained independently to make predictions improves accuracy further. One obvious drawback of the ens...
متن کاملENERGY AWARE DISTRIBUTED PARTITIONING DETECTION AND CONNECTIVITY RESTORATION ALGORITHM IN WIRELESS SENSOR NETWORKS
Mobile sensor networks rely heavily on inter-sensor connectivity for collection of data. Nodes in these networks monitor different regions of an area of interest and collectively present a global overview of some monitored activities or phenomena. A failure of a sensor leads to loss of connectivity and may cause partitioning of the network into disjoint segments. A number of approaches have be...
متن کاملDesign of a Novel Framework to Control Nonlinear Affine Systems Based on Fast Terminal Sliding-Mode Controller
In this paper, a novel approach for finite-time stabilization of uncertain affine systems is proposed. In the proposed approach, a fast terminal sliding mode (FTSM) controller is designed, based on the input-output feedback linearization of the nonlinear system with considering its internal dynamics. One of the main advantages of the proposed approach is that only the outputs and external state...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.10026 شماره
صفحات -
تاریخ انتشار 2018